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ABSTRACT 

Black students from two cities and Mexican- American 
students, both male and female, showed small but consistent 
tendencies to perform better, relative to White groups, on three 
nontradit ional measures; tests of inductive reasoning, spatial 
scanning, and associative memory. These measures showed somewhat less 
discrepancy between the groups than did tests of the more traditional 
verbal and mathematical aptitudes. The results seemed compatible with 
the suggestions of the Commission on Tests to expand the number of 
measures included in traditional testing programs. Other measures of 
these same aptitudes, and their validities, remain to be explored in 
this contexts 
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PROJECT ACCESS RESEARCH REPORT #3 
Minority Versus Majority G-roup Performance on 
an Aptitude Te; t Battery 
Ronald L. Flaugher 



Abstract 

Black students from two cities and Mexican-American students, both 
male and female, showed small but consistent tendencies to perform better, 
relative to White groups, on three nontraditional measures: tests of 

inductive reasoning, spatial scanning, and associative memory- These 
measures showed somewhat less discrepancy between the groups than did 
tests of the more traditional verbal and mathematical aptitudes. 

The results seemed compatible with the suggestions of the Commission 
on Tests to expand the number of measures included in traditional testing 
programs. Other measures of these same aptitudes, and their validities, 
remain to be explored in this context. 
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PROJECT ACCESS RESEARCH REPORT #3 

Minority Versus Majority Group Performance on 

1 

an Aptitude Test Battery 
Ronald L . Flaugher 

The content and structure of large-scale assessment programs have been 
determined , like many other aspects of our society, by the services and 
benefits that accrue to the white majority population of the United States. 

It is frequently the case that the decisions which are made to maximize 
this benefit either ignore or actually conflict with the wishes and needs 
of ethnic minority groups. The purpose of the present study was to examine 
a multimeasure aptitude test battery for suggestions of revisions that would 
counteract this effect . 

In an earlier report (.Flaugher, 1971) data from this same test battery 
were compared to those from previous studies by Lesser, Fifer, and Clark 
(1965) and Stodolsky and Lesser ( 1967 ). In those studies, each ethnic group 
was treated as if it were equal in size to the other groups, which yielded 
interesting results, but was not representative of the numerical imbalances 
in the society. For this reason, the present analysis was conducted, in 
which each minority group was compared individually with data, on the available 
samples of the white population. It is the differences in performance which 



"^Project Access Research Reports: #1, Reactions to a very difficult 

test by an inner-city high school population: A test and item analysis, by 

Ronald L. Flaugher and Lewis W. Pike, College Board Research and Development 
Reports 69 - 7 O, No. A, and ETS Research Memorandum 70-11, June 1970; #2, 

Patterns of test performance by high school students of four ethnic identities, 
by Ronald L. Flaugher, College Board Research and Development Reports 70-71^ 

No. 9, and ETS Research Bulletin 71-25, May 1971* 
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appear between any particular minority group and the majority that will 
likely be the significant ones for that minority and that will have the 
greatest implications for the appropriateness of particular assessment 
techniques . 

Project Access , through which these data were collected, and the 
specific content of the test battery are described in detail in Appendix I. 

Procedure 

The analysis utilized here is a duplication of that employed in the 
previous study of these data (Flaugher, 197 l ) > with the exception that just 
two rather than four groups are included, such that each minority group is 
compared only with the majority group. This method adjusts the two group 
means as if the average of those two means were 50 and the total standard 
deviation were 10 . The advantage of this approach is that it permits a 
graphical presentation of group mean differences on a variety of tests- n 
presentation which is independent ^ _aes , and thus permits 

comparison of relative performance from test to test. The groups are once 
again treated as if they were of equal size, but the single comparisons of 
each minority group wi^h the majority group is a more meaningful treatment 
than in the previous study. 

Da^a from two cities were available for this analysis: Memphis* 

Tennessee and Los Ang les , California. Analyses were conducted separately 
by sex, and in each case the comparisons were made with the White group of 
the appropriate sex. Three minority ethnic groups were available in L )S 
Angeles: Black, Mexican- American, and Oriental; in Memphis, just Black 

students were available in sufficient numbers to be compared meaningfully 
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to the White students of that city. This resulted, then, in eight indepen- 
dent comparisons with the appropriate White samples: (l) Black Los Angeles 

males, (2) Black Los Angeles females, (3) Black Memphis males, (4) Black 
Memphis females, (3) Mexican- American Los Angeles males, (6) Mex.i can-American 
Los Angeles females, (7) Oriental Los Angeles males, and (8) Oriental Los 
Angeles females - 

The sizes of the groups utilized in this analysis are presented in 
Table 1. 



Results 

The data from the nine tests in the battery are presented here, grouped 
to conform to the presentation in the previous studies. Four of the tests, 
those which best represent the four aptitudes of Verbal, Reasoning, Mathe- 
matics, an" Space, are presented together and in that order; the results 
from the remaining five tests are presented separately. The numerical values 
for all of the following graphs are presented in tabular form in Appendix II. 

Figure 1 presents the comparison of Black students with White students 
on the first four tests, for both male and female students, in two cities. 
Thus, there are four independent comparisons of test performance represented 

on each of the four kinds of test material. 

In order to assure proper interpretation of this graph and the ones 
to follow, it should be understood that there is a "mirror image” nature 
about the values represented in the graphed lines. That is, each pair of 
points indicating mean values on a particular test (e.g., Black and White 
Males, or Black and White Females) must necessarily have an average, 
between the two of them, of 50, by virtue of the conversion technique. 
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Table 1 

Options and Responses for Students on the 
Item "How bo You Describe Yourself?" 





Male 


Female 


Total a 


Los Angeles 


Blach, Afro -American ^ Negro 


1211 


1923 


3164 


White 


155 


151 


312 


Mexican-American, Brown 


51 * 


562 


1094 


Oriental 


207 


200 


4 ll 


Memphis 


Afro-American, Negro 


1120 


1.61k 


£773 


White 


8 64 


950 


1844 



a Total equals number of males plus number of females plus 
number of students who did not indicate sex. A total of 196 
students indicated ethnic categories other than those shown here ; 
and 286l students chose either to omit this item or to give 
multiple responses; these were not included. 
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In other words , In Figure 1 the distance above the midpoint of 50 for the 
White group is necessarily matched by the distance below that midpoint 
for the Black groups . 

With this understanding, then, interpretations can be made of the 
graphs. It can be seen that there are both similarities and differences 
between the left and right sides of the figure. The similarities are that 
the two graphs are rather more "flattened" than was the case when four ethnic 
groups, rather than two, were included (see Flaugher, 197l)> but the present 
two graphs show a trend toward less discrepancy on the reasoning and spatial 
tests than on the verbal and mathematical tests . 

The main difference between the two graphs is perhaps the wider 
separation between the Black and White groups from Memphis than those from 
Los Angeles. Rather than indicating that the respective students are some- 
how "better" or "worse" in one city than in another, it should be noted 
that this difference in separation is almost certainly due to the kind of 
selection for participation in Project Access that occurred in the two 
cities. It has been well documented that socioeconomic status is related 
to general level of test performance, and the students who participated 
in Project Access from the two cities differed a great deal on that variable. 
In Los Angeles, a selected group of inner city schools, characterized by low 
socioeconomic status, were participants in the study. In Memphis, however, 
virtually the entire school system participated, including a large number 
of White students from high socioeconomic backgrounds. The effect on the 
test scores was to create a wider separation between groups in Memphis than 
in Los Angeles, as shown in Figure 1. 
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x„ Figure 2, the data are presented fro*, the sample of Mexiean-American 

Students in Los Armies. On the deft sf.de of the Figure is the graphing of 

• 1-4- ~ -u v, ^ t psults from tbe additional 

the first four tests, and to the rig ar 

five tests. Within the graph of the First Four tests, once again the two 
least discrepant tests are the reasoning and the spatial tests for the 
comparison of the male groups; For the Female groups the reasonrng tes 
does not show the lessened discrepancy hut the spatial test does, whrch 
is similar to the pattern For the Blach Females of Los Angeles shown an 

Figure 1. 0 

The additional Fite tests, graphed in the right side of Frgure a, 

sh ow the Mexican- American students, both male and female, doing -latswelF 

. , and Picture Humber (associative memory) and, 

better on Figure Analogies ana ncou. 

males only, on Sentences. 

in Figure 3, the results from both Memphis and Los Angeles Blac 
students are presented For the additional Five tests. This combining of 
results from the two cities into one graph was possible because o e 

differences between the cities in the socioeconomic level of the partrcr 
pants, as mentioned earlier, and the resultant differences in mean scores 

for any of the tests, except a dramatic reduction in differences for the 

_ j. -f-Vw^TR are similar ires u. XX s 

Picture Number test. In the Los Angeles data there 

. _ nd there is a smaller difference for the Blac 

for the Picture Number test, and there 

males on Reading and Sentences. 

Finally, for completeness, the Oriental-with-White comparisons are 
g raphed in Figure The interpretation of these differences are a 

different matter, since it is the Oriental students who obtained hrgher 
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Figure 3, Contrasting patterns for five "additional" tests, by sex, between 
Black and White students from two cities. 
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scores, and consequently their graphed lines appear above the midline and 
the White students r lines appear below. It happens that there is some 
evidence from the background question'- ire data that the Oriental students 
were of a higher socioeconomic level than the White stude, the same 

schools , and this might well be the explanation of these qu: " di fere t 

results . ^ For the purposes of the present discussion, howev L.-e da - a are 
presented here only for completeness and will not be included — - he i -.er 
pretations of the value of particular types of aptitude test. 

Discussion 



The differences from test to test in Figures 1* 2, and not ~arge 

for the most part,, but there is a certain amount of consistency in the 
results that would seem to deserve attention. For example, the comparison 
of Black and White students in Figure 1 shows that the Letter Groups and 
the Choosing A Path tests reduce the discrepancy between the races slightly 
over the discrepancy found in the Vocabulary and the Mathematics tests. In 
Figure 2, showing results for the Mexican -American students, the same trend 
is again present, but is small. However, the fact that the trend did occur 
in most of the situations where it was possible would indicate that some 
nonchance influence is at work and that these differences deserve further 
consideration. The same may be said of the consistent and sharp reduction 
in differences on the Picture Number test, as shown in Figures 2 and 3- 
(For a discussion of the influence of test reliabilities on these differences 

see Appendix III.) 
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‘The same caveat- applies, of course, in the interpretation of any of the 
mean differences between any of the groups. That is, the ?,ES :an be expected 
to influence the level of test scores, but not the relativ' raniing of the 
tests within a group (Flaugher, 1971)* 
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Besides the consistency; another justification for the interpretation 
of these rather small differences between the relative test scores is the 
potential importance of even slight alterations in test performance by a 
large group of students. In national testing programs, because of the 
very large numbers of students involved, even slight differences in test 
score can be of consequence to a great many students, particular a_y in 
settings of competition for postsecondary educational opportunities. Bor 
example, on the basis of these results, the consequences of utilizing 
Letter Groups in a testing program, either in addition to the traditional 
verbal and math measures or as a substitute for one of them, would be 
higher scores for many Black and Mexican-American students . 

It is likely that for any given subgroup of the population, a partic- 
ular test could be designed on which that subgroup would score higher than 
any other. The viability and the usefulness of such a measure would remain 
to be proved; however. Such was not the case with the measures employed in 
the present study, in that each measure is either an example of a well- 
documented cognitive factor or an established measure of academic aptitude. 

Questions of current validity are of interest, of course, and such 
information is presently being sought for these same students. But docu- 
mentation of reduced discrepancies in test performance would seem justified 
in itself, since it reveals the existence of aptitudes which, because they 
are not typically documented in the traditional test batteries, are not 
acknowledged to exist in any great quantity. If they are not acknowledged, 
it is unlikely that they are being utilized in the traditional processes of 
education, and in such a situation, calculated validities would necessarily 
be low. Rather than reject the use of those particular aptitude measures. 
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however, it would seem more appropriate to consider the alteration of the 
curriculum to utilize more fully those talents whose existence is documented. 
In such an ideal circumstance, validity coefficients might then be considered 
an index of the success of the curriculum, rather than a measure of the worth 
of a particular aptitude measure. 

The data presented here acquire additional significance when considered 
in the light of the recent "Report of the Commission on Tests (College 
Board, 1970 ), whose charge was tc review the testing functions of the College 
Entrance Examination Board and to consider possibilities for fundamental 
changes in tests and their use. In particular, John Carroll suggested an 
expansion of the number of component parts of the SAT to four or five 
reported scores. He suggested separation of the measured abilities into 
purely verbal, purely mathematical, and an isolation of the reasoning com- 
ponent from both the verbal or mathematical components. In discussing the 
separation of reasoning from the verbal scores, he stated that "presumably, 
the verbal scores would be largely a function of the individual's education 
and general reading experience, while the reasoning score would be less 
influenced by these factors and be more predictive of success for individuals 

with educational disadvantages" (p. 5)« 

Certainly a comparable case could be made to justify the separation 
of the mathematics score from the reasoning score. Mathematical test per- 
formance is even more clearly a function of the amount and quality of the 
formal education that the student has been exposed to, and there is ample 
evidence that these minority groups have not received a high quality 
education. Therefore, to attempt to judge such a student s reasoning 
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capacity by examining • score on a mathematics test would seem to be 
unwise and perhaps quite uniair in effect. 

The results of the present study, then, seem compatible with the 
suggestions of the Commission on Tests to expand the number of measures 
included in traditional testing programs. Further, if 1 it were necessary 
to choose the particular measures that should be included, then the 
measures of inductive reasoning, spatial aptitude, and associative 
memory could be recommended# Existing measures of other cognitive 
factors need to be explored, however, as do other measures of these 
same factors . 
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APPENDIX I 

Description of Project Access and Its Test Battery 

Project Access is a program sponsored by the College Entrance Examina- 
tion Board and has as its goal the providing of access to post-secondary 
education for minority /poverty students, while applying the knowledge gained 
in this endeavor to other testing and measurement activities that affect 
these students. In the 1969—70 school year, a nine— test Project Access 
battery was administered, together with a Biographical Inventory, to approxi- 
mately 18,000 junior-year students in three cities: Los Angeles, Memphis, 

and Washington, D. C. Not all schools in each city participated. They were 
chosen within the city on the basis of interest and willingness to participate. 

The nine aptitude tests included in the battery were as follows: 

Vocabulary , 30 synonym items, 15 minutes. Knowledge of word meaning is 
measured . 

Letter Groups , 25 items, 15 minutes. A measure of inductive reasoning. 

Five sets of four letters each are presented, the task being to find the rule 
which relates four of the sets to each other and to mark the one which does 
not fit the rule. 

Mathematics ^ 50 items, 30 minutes. The quantitative comparison type of 
item was employed, involving the comparing of pairs of mathematical quantities 
to determine equality, or direction of inequality. 

Choosing A Path ^l8 items, 10 minutes. Spatial scanning is measured. 

Each item consists of a network of lines as in an electrical— circuit diagram 
having many intersecting and intermeshed wires with several sets of terminals. 
The task is to trace the lines and to determine for which pair of terminals 
there is a complete circuit. There is some orderliness in the layout to 
encourage comprehension of the pattern by scanning rather than by simple 

visual pursuit of lines. 
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Reading, 35 items, 25 minutes 

mutes . Reading comprehension items on passages 

from short stories and articles. 

Sentences , 40 items, 20 minutes. Detection of 
, . Detection of errors in written passages 

solving language, punctuation, and capitalization. 

Year 2000. ?n ^ 

items, 10 minutes. Abllitv t-o 

y to follow complex directions 

demonstrated by finding certain ^ „ . calend „ £ „ ^ y _ 200Q _ ’ 

^affiS-Analogies, 25 items, 10 minutes. Analogy items composed of small 
geometric designs and figures. 

Plctur e-Numb er . IS it-omm t_ 

ems m each of two parts, 10 minutes. A two-part 

measure of associative memory between drawings and two H • • 

wmgs and two-digit numbers. For 

each part, three minutes are allowed for study of the pairs th t- 

pairs, then two minutes 

or recalling the numbers „ hen presented with the drawings. 

In addition to the tests, a 15-minute biographical inventory asked infer- 
mation about ethnic iHpnt-it 

7, sex, aspirations, school grades, interests, 

and socioeconomic status. 
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VOCABULARY LETTER GROUPS MATHEMATICS 



APPENDIX II 



Means and. Sigmas of Raw Scores on Each Test, Converted 
Scores, and Majority-Minority Differences 
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APPENDIX III 



Test Reliabilities and Their Influence on 
Mean Differences 



If the two "tradi tional" tests. Vocabulary and Mathematics, have 
higher reliabilities than the "new" tests. Letter Groups, Choosing A 
Path, and Picture-Number, then there exists the possibility that the 
lesser mean differences between the groups on the new tests are due to 
this, rather than because of some qualitative differences in the tests. 

Such an explanation assumes that reduced reliability would act to reduce 
mean differences between the groups . 

The reliabilities of seven of the nine tests were estimated by 
Dressel’s modification of the Kuder— Richardson formula 20, which takes 
formula scoring into account (Dressel, 1940). The median obtained 
reliability estimates for the first four tests were: Vocabulary, .83; 

Letter Groups, .85; Mathematics, .91; and Choosing A Path, .74, Thus it 
appears that such an explanation is possible, but by no means a certainty, 
for the Choosing A Path results, since it has the lowest reliability of 
the four; such an explanation is not appropriate, however, for the Letter 
Groups results. As for the third "new" test with noticeable and consistent 
reductions in discrepancy, namely Picture-Number, the median reliability 
estimate was .93 and thus unlikely to be a factor in the results. 
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